Constrained Submodular Minimization Towards Missing Labels and Class Imbalance in Multi-label Learning
نویسندگان
چکیده
Copyright c © 2016, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved. edges among instances EX for each class (building connections between the entries within the same row, for every row of Z); last, we copy the edges among classes EC for each instance (building connections between the entries within the same column, for every column of Z). di 6= ∑mn j W(i, j): if (i, j) ∈ EX , then di = dX (̂i), with î being the instance index (i.e., the column index of Z) corresponding to the node i in G; similarly, if (i, j) ∈ EC , then di = dC (̂i). That means we normalize W(i, j) by the sum of instance-level neighbors (in the same column) or class-level neighbors (in the same row), rather than the sum of all neighbors. As a result, this problem is a partially normalized graph-cut problem. Interestingly, the formulation in (3) is exactly the same as that of the standard GSSL problem (Zhu 2006). The only difference is that L is not a normalized Laplacian matrix in (3). Please see Lemma 1.
منابع مشابه
Constrained Submodular Minimization for Missing Labels and Class Imbalance in Multi-label Learning
In multi-label learning, there are two main challenges: missing labels and class imbalance (CIB). The former assumes that only a partial set of labels are provided for each training instance while other labels are missing. CIB is observed from two perspectives: first, the number of negative labels of each instance is much larger than its positive labels; second, the rate of positive instances (...
متن کاملExploiting Associations between Class Labels in Multi-label Classification
Multi-label classification has many applications in the text categorization, biology and medical diagnosis, in which multiple class labels can be assigned to each training instance simultaneously. As it is often the case that there are relationships between the labels, extracting the existing relationships between the labels and taking advantage of them during the training or prediction phases ...
متن کاملTowards Class-Imbalance Aware Multi-Label Learning
In multi-label learning, each object is represented by a single instance while associated with a set of class labels. Due to the huge (exponential) number of possible label sets for prediction, existing approaches mainly focus on how to exploit label correlations to facilitate the learning process. Nevertheless, an intrinsic characteristic of learning from multi-label data, i.e. the widely-exis...
متن کاملLarge-scale Multi-label Learning with Missing Labels
The multi-label classification problem has generated significant interest in recent years. However, existing approaches do not adequately address two key challenges: (a) scaling up to problems with a large number (say millions) of labels, and (b) handling data with missing labels. In this paper, we directly address both these problems by studying the multi-label problem in a generic empirical r...
متن کاملTowards Label Imbalance in Multi-label Classification with Many Labels
In multi-label classification, an instance may be associated with a set of labels simultaneously. Recently, the research on multi-label classification has largely shifted its focus to the other end of the spectrum where the number of labels is assumed to be extremely large. The existing works focus on how to design scalable algorithms that offer fast training procedures and have a small memory ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015